Protein Sequence-Structure Alignment Using 3D-HMM

نویسندگان

  • Masashi Fujita
  • Hiroyuki Toh
  • Minoru Kanehisa
چکیده

Tertiary structure of proteins provides valuable information on their biochemical functions. But experimental structure determination is still labor-intensive, in spite of technological advances. Therefore, rapid and accurate protein structure prediction method is not only interesting from a theoretical point of view but also practically useful. Currently, homology modeling, or comparative modeling, is the most reliable structure prediction method and widely used. It is applicable when structure of at least one homolog of the target protein is solved, and predicts 3D coordinates by aligning the target sequence and the template structure. It is noted that the accuracy of homology model depends largely on the accuracy of target-template alignment. But the alignment accuracy of conventional alignment algorithms declines sharply when the sequence identity between the target and the template proteins is lower than 45% [4]. Here we introduce a novel sequence-structure alignment method to improve the alignment accuracy between distantly related proteins. It is based on hidden Markov model (HMM), which have been successfully applied to many bioinformatics problems. The distinguishing feature of our method is the explicit consideration of 3D coordinates of each amino acid residue. Each state of the HMM has a coordinate in 3D space. Alexandrov and Gerstein first introduced such idea in the field of protein structure classification, and abbreviated it as 3D-HMM [1]. In this study, we adapted 3D-HMM for the protein sequence-structure alignment problem and tested its ability in alignment accuracy. These spatial coordinates allow us to introduce structure-based transition probabilities such as restriction on the distance between two consecutive Cα atoms. Moreover, an alignment path generated by 3D-HMM has explicit 3D coordinates. Thus, energy of the model can be directly calculated without additional model building procedure. This can greatly enhance the speed of model quality evaluation when we have a large number of alternative models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein homology detection by HMM–HMM comparison

Motivation: Protein homology detection and sequence alignment are at the basis of protein structure prediction, function prediction and

متن کامل

HMM based approach for classifying protein structures

To understand the structure-to-function relationship, life sciences researchers and biologists need to retrieve similar structures from protein databases and classify them into the same protein fold. With the technology innovation the number of protein structures increases every day, so, retrieving structurally similar proteins using current structural alignment algorithms may take hours or eve...

متن کامل

SAM-T08, HMM-based protein structure prediction

The SAM-T08 web server is a protein structure prediction server that provides several useful intermediate results in addition to the final predicted 3D structure: three multiple sequence alignments of putative homologs using different iterated search procedures, prediction of local structure features including various backbone and burial properties, calibrated E-values for the significance of t...

متن کامل

Prediction of 3D protein Structure based on Mutation of AKAP3 and PLOD3 Gene in Case of Non-Obstructive Azoospermia

Background: The present study has been designed with the aim of evaluating A-kinase anchoring proteins 3 (AKAP3)and Procollagen-Lysine, 2-Oxoglutarate 5-Dioxygenase 3 (PLOD3) gene mutations and prediction of 3D proteinstructure for ligand binding activity in the cases of non-obstructive azoospermic male.Materials and Methods: Clinically diagnosed cases of non-obstructive azoos...

متن کامل

Alignment-free Prediction of Ribonucleases using a Computational Chemistry approach: Comparison with HMM model and Isolation from Schizosaccharomyces pombe, Prediction, and Experimental assay of a new sequence

The study of type III RNases constitutes an important area in molecular biology. It is known that the pac1 gene encodes a particular RNase III that shares low amino acid similarity with other genes despite having a double-stranded ribonuclease activity. Bioinformatics methods based on sequence alignment may fail when there is a low amino acidic identity percentage between query sequence and oth...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004